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Tiered Web Site with Web-Tier Admission Control Based 
on Utilization Data Generated on Application Tier 

[0001] BACKGROUND OF THE INVENTION 

5 [0002] The present invention relates to computers and, more 
particularly, to "host" computer netvs^orks that respond to requests 
from extemal "cUent" computers. A major objective of the present 
invention is to enhance the quahty of service associated with a host 
site on the World Wide Web, 

10 [0003] Much of modem progress is associated with the 
development of computers and the Internet, which permits 
computers to communicate worldwide. Several protocols exist by 
which a host site, which may comprise one or more "server" 
computers, receives and processes messages from a number of 

1 5 other computers ("chents"). For example, HTTP (HyperText Transfer 
Protocol) is a communications protocol used on the World Wide Web 
to enable users to navigate within and among host sites. 

[0004] Messages can usually be grouped into sessions, with 
each session having one or more related messages. For example, a 

20 session can consist of a message requesting information over the 
World Wide Web, and an associated response. Altematively, a 
multiple-message session can consist of a commercial transaction, 
with related messages respectively used to locate within a web site a 
precise ' product, submit an order or billing and shipping 

25 information, and convey a confirmation of sale to a particular cUent. 
Whether a host is to process just a single message or a series of 

1 
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related messages, it is usually important to quickly, accurately, and 
completely service each message and each session. 

[0005] The term "quahty of service" refers to a host's abiUty to 
respond quickly to individual messages and to complete sessions, 
5 As a particular host becomes more popular, and therefore receives 
more messages, the host's processing resoiurces can become 
stretched. For example, due to heavy traffic, a host may not be able 
to respond to a message at all, or the host may not provide a timely 
response (which can cause a cUent to "time-out" with an error or the 
1 0 impatient user to resend the message). Poor quahty of service can 
have significant impact, as users may become frustrated and simply 
give up trying to reach a particular host, and the sponsor of the host 
may lose sales or fail to communicate needed information to some 
chents. 

1 5 [0006] Quahty of service can be improved by adding 
processing capacity and by implementing admissions control. 
Different approaches to increasing capacity are discussed further 
below. In many cases, it is cost effective to provide sufficient 
capacity to handle all messages most of the time, while relying on 

20 admissions control for peak demand situations. Even where 
additional hardware resources can be added on demand, peak usage 
can occin too suddenly for the additional capacity to be operational; 
the time from the spike being identified to the time additional 
capacity is available can be hours to days for manual operations and 

2 5 many minutes to hours for automatic or semi-automatic operations. 
During this time, admission control is often the best alternative to 
handling an immediate spike in usage, especially when the spike is 
short Uved and far higher than the site's demand growth trend. 
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[0007] Admissions control involves selectively admitting and 
rejecting messages so that the messages that are processed are 
handled promptly and rehably. InteUigent admissions control can 
prioritize messages to advance the purposes of the host site. For 
5 example, messages associated with ongoing sessions can be given 
priority. U.S. Patent 6,006,269 to Phaal discloses a system in w^hich 
admission determinations are made on a session basis and in which 
non-admitted messages are deferred with a higher priority level 
rather than rejected outright. Thus, opportunities associated with 
1 0 non-admitted messages are not necessarily lost. 

[0008] Preferably, the admissions decisions are based on 
utihzation data gathered by the host site. All messages can be 
admitted when resource utihzation is low. As it increases to 
potentially problematic levels, admissions can become increasing 
1 5 selective. However, since there is a potential detriment to the site 
whenever admissions control rejects or defers a message, admission 
control is not desirable on a steady-state basis. It is preferable that 
host-site capacity be able to process all messages most of the time 

[0009] In general, quahty of service can be improved by adding 
20 processing power to the host site. For example, in a single-server 
site, a less powerful server can be replaced with a more powerful 
server. An advantage of the single-server replacement approach is 
simphcity. A single computer handles all site functions. Programs 
running on that computer can handle message, monitor resources, 
25 and administer admissions control. 

[0010] On the other hand, there are limits to the single-server 
approach. At the high end, where the limits of available technology 
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are pushed, fractional increases in power are quite costly. 
Furthermore, when the single server is dovm, the entire site is dovm, 
Also, a replacement strategy can be wasteful if the replaced server is 
no longer used. In practice, most large sites use multiple servers- 

5 [0011] Similarly configured severs can be arranged in parallel. 
Servers can be added in parallel as needed to increase capacity. 
A load-balancing mechanism can be added at the front end to 
distribute messages among the parallel servers. However, the 
parallel servers need to be coordinated to ensxire session integrity. 

1 0 Also, while admissions control can be done independently by each 
server, this results in messages being rejected by one server while 
there is ample capacity on another. It is not practical for the 
parallel servers to commimicate with each other regarding each 
chent request. While . the coordination can, in principle, be 

1 5 performed at the load balancer, imposing additional processing 
requirements on the common node for aU messages can result in an 
unacceptable performance bottleneck. 

[0012] A tiered host site overcomes many of the problems 
facing a multi-server host site. A typical tiered host site has a 

20 cUent-response tier (e.g., web tier using HTTP protocol) and an 
apphcation tier. The appUcation tier performs most of the resource- 
intensive work regarding the purpose of the site, while chent- 
response tier typically provides the chent interface. For example, 
the cUent-response tier can provide a web interface for a chent, 

25 while the apphcation tier handles e-commerce (electronic 
commerce) apphcations and maintains a database accessed by the 
apphcations. The appUcation tier can manage sessions, tagging 
responses so that subsequent messages in the session can be 
identified as such. The chent-response tier can use the session tags 
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to generate cookies and/or links associated with session-specific 
URLs to direct subsequent client requests to the proper appUcation 
server, 

[0013] Each tier can be expanded independently as required. 
5 Since the chent-response tier manages the distribution of messages 
to the appUcation tier, appUcation tier servers can be differentiated 
by function. Capacity can be increased by activating (adding, 
tinning on, or re-aUocating) a server dedicated to the stressed 
function so that the additional resources are not wasted on 
1 0 underutiUzed appUcations, 

[00141 The cUent-response tier can utiUze paraUel similarly 
configured servers. Coordination among the paraUel servers is 
faciUtated by the session tags added by the appUcation tier. 
Expansion of the cUent-tier capacity is then readily achieved by 

1 5 adding additional servers in paraUel. Alternatively, the cUent- 
response tier can be configured as a load-balancing hub supported 
by other servers dedicated to specific cUent-response functions, 
such as encryption and decryption. WhUe the hub approach does 
place additional burdens on the common-node load balancer, 

20 session tracking is stiU managed primarily at the appUcation tier. A 
hubbed cUent-response tier can be expanded efficiently by adding 
servers dedicated to a specific function that is over utiUzed. 

[0015] In a tiered site, the admissions control function is 
typicaUy assigned to the cUent-response tier since it provides the 
25 front-end interface to the cUent computer. The admission control 
function can monitor local resource utiUzation effectively on a per- 
server basis. The admission control function on the cUent-response 
tier can monitor response times associated with requests to the 
appUcation tier as a measure of its resource utiUzation. The 
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resource utilization information about the client-response tier and 
the apphcation tier then is used to determine the admissions 
control poUcy at any given time. 

[0016] While the tiered host site apDroache approach provides 
5 for efficient scaling and for effective admissions control, there is an 
insatiable demand for better performance. In particular, there is a 
demand for better admission control, since it is a software 
component that can, in principle, be upgraded less expensively than 
the host site hardware. What is needed is more effective admission 
1 0 control for a tiered host site. 

[0017] SUMMARY OF THE INVENTION 

[0018] The present invention provides a networlc host site in 
which admission control determinations are made on a chent- 
response tier using resource utOization data collected on an 

1 5 apphcation tier. The invention further provides for admissions- 
control determinations that are also made as a fimction of resource- 
utihzation data collected on the cUent-r espouse tier. In addition to 
being used for admissions determinations, the resource utihzation 
data can be used to request an increase or decrease of active server 

20 resources. 

[0019] The host site has distinct chent-response and 
apphcation tiers in that each host site host-site server is assigned to 
at most one of the tiers. Servers that only handle chent interfacing 
are assigned to the chent-response tier, while servers that run non- 
25 interfacing apphcations but do not handle chent interfacing are 
assigned to the apphcation tier. Servers that handle both non- 
interfacing apphcations and chent interfacing are assigned to the 
chent-response tier. 
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[0020] The invention provides for a variety of tier 
configurations- The client-response tier can be configured as a 
single server, as a set of parallel servers with a load-balancing front 
end, or as a set of speciahzed servers managed by a routing hub, 
5 The appUcation tier can have sub-tiers; for example, speciahzed 
apphcation servers can be serviced by one or more database servers 
on a back end. Alternatively, the apphcation tier can be arranged 
around a routing hub, 

[0021] While appUcation-tier performance can be assessed by 
10 the chent-response tier, local apphcation-tier resource monitoring 
has several advantages. First, the apphcation server as a whole can 
be measured independently of the network and even other servers 
(such as the database). Also it allows for a disaggregated measure 
of an apphcation' s performance by resource components (for 
1 5 example, memory, CPU, I/O). Moreover, it allows each apphcation to 
associate measurements with its own admissions classes and base 
them on session or even chent identity. 

[0022] Finally, resource monitoring can be customized on a 
per-server basis. Where multiple apphcations are running on the 

20 same server, customization can be implemented on a per- 
apphcation basis. This allows for better optimization of apphcation 
resources. For example, the apphcation tier can generate resource 
utihzation data in XML ("Extensible Markup Language) format with 
instructions to the chent-response tier regarding interpretation and 

25 usage of the apphcation-tier resource utihzation data. These and 
other features and advantages of the present invention are apparent 
from the description below with reference to the foUowing 
drawings- 
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[0023] BRIEF DESCRIPTION OF THE DRAWINGS 

[0024] FIGURE 1 is a schematic diagram of a liost site with 
parallel chent-response-tier servers and comiected to multiple 
cUents in accordance v^th the present invention. 

5 [0025] FIGURE 2 is a schematic diagram of the host site of FIG. 
1 indicating internal communications paths and scalabihty options. 

[0026] FIGURE 3 is a flow chart of a method of the invention 
implemented on the host site of FIG. 1. 

[0027] FIGURE 4 is a schematic diagram of a host site with a 
1 0 hub - based chent - responsc tier and connected to multipl e ch e nts In 
accordance with the pres e nt invention. A n altemative embodiment is 
described but not illustrated. 

[0028] DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0029] In accordance with the present invention, a host site 
15 API communicates with chent computers 11 over the Intemet, 
indicated by a connection 13, as shown in FIG. 1. In this case, host 
site API is an e-commerce site, and more specifically a flight- 
scheduhng and reservation-handling site. More generally, the 
invention provides for host sites with a wide range of purposes. 
20 The host site is divisible into an apphcation tier 15 and a web 
tier 17. In FIG. 1, chent computers are depicted as part of a cUent 
tier 19. This is a conceptuahzation from the perspective of the host 
site and does not imply any special relationship among the chent 
computers other than the abiUty to communicate with host site API. 

25 [0030] The host-site hardware is shown in greater detail in 
FIG. 2. Web tier 17 has three active servers WSl, WS2. WS3, and a 
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load balancer 21. In periods of high demand, host site API can 
activate up to three additional servers WS4, WS5, and WS6- 
Application tier 15 includes a database server ASl and two front- 
end application servers AS 2 and AS 3, while an additional 
5 apphcation server AS4 can be activated when additional resources 
are required. Servers WS4, WS5, WS6, and AS4 are normally inactive 
or used elsewhere in order to save costs. The relatively greater 
expandabihty of the web tier (indicated by having three servers in 
reserve versus one for the apphcation tier) corresponds to the 
1 0 greater computing power and state maintenance normally required 
of the apphcation servers, and thus the greater expense involved In 
adding servers to the apphcation tier. 

[00311 Apphcation-tier servers, may, but need not, run the 
same software. For example, apphcation server ASl runs a database 

1 5 program that organizes information regarding flight schedules. 
Apphcation server AS 2 runs a flight scheduling apphcation, while 
apphcation server AS 3 runs a fhght reservation apphcation. In 
times of high demand, apphcation server AS4 can provide additional 
capacity for the flight-reservation function. Moreover, each 

20 apphcation server can run multiple apphcations, some of which may 
run on only one apphcation server, some of which may run on aU 
apphcation servers, and some of which may run on more than one 
but not aU apphcation servers. However, the apphcation servers do 
not run the web-serving apphcations that interface directly with 

2 5 chent tier 19. 

[0032] Web servers WSl, WS2, and WS3, as weU as the other 
web servers when active, run respective copies of the same 
software. Each runs a web serving apphcation that provides the 
main interface to browsers running on chent computers 11. Load 
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balancer 21 distributes client requests in round-robin fashion to the 
active web servers. In addition, each server can run one or more 
appUcation programs other than web servers. However, the more 
complex application programs are run on the appUcation-tier 
5 servers. 

[00331 The common functions of servers on web tier 17 are 
shown in FIG. 1. Client requests are received at an admission 
control gateway 23- Gateway 23 initially determines whether or not 
the request is encrypted. If it is, it is decrypted at encrypt/decrypt 

10 function 25. Once the request is decry^ed or if it was not 
encrypted, admission control gateway 23 determines whether to 
pass it on to (software) chent-request processor 27, or to deferral 
manager 29. The decision is made as a fimction of the 
programming of admission control gateway 23 by deferral 

1 5 manager 29. 

[0034] In the event the request is passed to it, client-request 
processor 27 determines whether it can respond to the request 
using local (web- tier-server) resources, or' whether it needs to 
forward the request to appUcation tier 15. In the former case, a 

20 cUent response can be generated in accordance with the cUent 
browser protocol (e.g., HTTP). The cUent response is passed to 
admissions control gateway 23. If encryption is required, 
admissions control gateway 23 passes the cUent-response to 
encrypt/decrypt fimction ^7 -25 and forwards the encrypted result 

25 to the cUent that made the request. 

[0035] If the client request cannot be met locally, client 
request chent-request processor ^5 —27 formulates an appUcation 
request for an appUcation processor 31 in the form of software 
running on appUcation tier 15. Since not aU servers of appUcation 

10 
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tier 15 run the same applications, client-request processor £5-27 
must direct the appUcation request to the appropriate server and 
appUcation, If that target appUcation is running on more than one 
appUcation-tier server, cUent-request processor £5 — 27 m ust 
5 determine which of those servers to direct the appUcation request 
to. GeneraUy, requests associated with the same session are 
directed to the same appUcation server. Otherwise, a selection can 
be made based on unused capacity, or more simply on a round- 
robin basis. 

1 0 [0036] Some appUcation requests can require processing on 
more than one server. For example, a request for flight-schedule 
information may require a flight-schedule appUcation on server AS2 
(FIG. 2) to access database server ASl. However many appUcation 
servers are involved, an appUcation response is generated at 

1 5 appUcation tier 1 5 in response to the appUcation request. The 
appUcation tier can tag the appUcation response with session 
identification information so that subsequent cUent requests in the 
same session can be correlated. 

[0037] The appUcation response is forwarded from appUcation 
20 processor 31 to the cUent-request processor 2527. TypicaUy, the 
appUcation response is transmitted to the web-tier server that 
generated the corresponding appUcation request. Altematively, the 
selection of web-tier server for the appUcation response can be 
selected on some other basis, e.g., round robin. Once cUent-request 
25 processor £5 —27 r eceives an appUcation response, it generates a 
cUent response, treatmg it in the manner described above for 
locaUy - answered l ocaUv answered cUent requests. 

[0038] CUent-request processor £5 —27 can use the session 
identification information to generate a cookie or a imique URL. 
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This URL can be communicated in the form of a web page Unk that 
generates a new chent request with the session information 
embedded- This enables a web-tier server to route a subsequent 
message in a session to the appropriate appUcation-tier server. 

5 [0039] Each apphcation-tier server has a resource monitor 33. 
The parameters 35 to be monitored can be selected, but typically 
CPU usage is one parameter that is monitored. The parameter 
values determined through this monitoring are forwarded from the 
resource monitor to the deferral manager 2729. The resource 
10 monitor 33 of each appUcation server forwards this data to every 
web-tier deferral manager £729. The deferral manager ^7-29 uses 
this information to determine the response of gateway 23 to chent 
requests. 

[00401 Conciuxently, the resource parameters associated with 
1 5 the web tier can be monitored. To this end, a web-tier resource 
monitor 37 on each web-tier server monitors usage including chent- 
request processing and encryption and decryption activity. Note, 
the web-tier resource utihzation data for each web-tier server can be 
provided to load balancer 21 (FIG. 2) so that it can take web server 
20 utihzation data into account when distributing chent requests. 

[0041] Depending on the programming, deferral manager 29 
can command gateway 23, in the alternative to admit all requests, 
admit no requests, admit only requests associated with on-going 
sessions, or admit requests according to some more complex 
25 criteria. Non-admitted requests can be forwarded to the deferral 
manager. The deferral manager can assign a time for a chent to 
resubmit the request. The deferral message is tagged so that a 
request in compUance with the deferral message is given 
preferential treatment at gateway 23. For example, resubmitted 

12 
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requests can be admitted while new requests are deferred. The 
deferral time can be set as a function of resource utilization- In 
addition to controlling admission criteria, the deferral manager can 
send a request to a resource manager on the appUcation tier to 
5 increase (or decrease) available resources. 

[0042] A method Ml of the invention practiced in the context 
of host site API is flow charted in FIG. 3. Method Ml includes two 
processes, a request-handling process PI and a resource-utlhzation 
process P2. Request-handling process involves chent tier 19, and 
1 0 the two host-site tiers: web tier 17, and appUcation tier 15. 

[0043] An illustrative sequence for request process PI begins 
with a client sending a request at step SI. Load balancer 21 (FIG. 2) 
receives this request and allots it on a round robin basis, taking into 
accoimt web-tier resource-utilization data, at step S2 to a web-tier 
15 server, e.g., server WSl. The gateway for the receiving server can 
either admit or defer this request. In this illustrative sequence, the 
request is deferred at step S3. In this case, deferral manager 29 
generates and transmits to the chent a deferral message indicating 
when the cUent can resubmit the request. 

20 [0044] The chent resubmits the request by cUcking a link in the 
deferral message at step S4. Load balancer distributes this 2''*^ 
request to a web-tier server at a second iteration of step S2. In 
general, the deferred request is admitted at step S5. 

[0045] Unless the chent-request processor of the web-tier can 
25 respond on its own, it generates an apphcation request and 
transmits it to the apphcation tier at step S6. Each web-tier server 
Icnows which apphcations run on which apphcation-tier severs, so 
the request is directed to an appropriate apphcation-tier server. 

13 
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The receiving server generates an application response at step S7. 
The receiving server may access other apphcation tier servers, 
including a database server, as necessary to formulate this 
apphcation response. The apphcation tier can tag the apphcation 
5 response with session-identification information to keep track of 
sessions. Finally, the chent-request processor generates a chent 
response, using the session identification to generate a cookie or a 
unique URL that can be used in routing subsequent messages in a 
session. 

1 0 [0046] The chent response S8 may result is-in a further chent 
request. Assuming this is a continuation of a session, the session is 
identified in the new request. This session information is ignored 
by the load balancer, but used by the selected web-tier server to 
direct a new apphcation request to the appropriate apphcation tier 

1 5 server. 

[0047] Resource process P2 is concurrent with multiple 
iterations of steps S1-S8- At step Rl, apphcation tier resources are 
monitored at the apphcation tier. For example, each apphcation tier 
server can monitor its own resoxirces. The monitor generates usage 
20 data at step R2. This data is transferred to the web tier and used to 
generate deferral criteria at step S^R3. The deferral criteria are 
used at the web tier to determine which requests are admitted (at 
step S5) and which steps are deferred (at step S3R3). 

[0048] In host site API, the web tier is arranged with servers 
2 5 operating in parallel. The invention provides for alternatives to the 
paraUel web-tier arrangement. For example, the web tier can be 
implemented on a single server. Furthermore, a multi-server web 
tier can differentiate servers by function, for example, as in an 
alternative host site svstem AP2 of FIG. 4 . 
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[0049] Host - site AP2 T he alternative host-site interfaces v^th a 
an alternative client tier 419 , which includes a number of cUent 
computers-4ii. Host - sit e AP2 T he alternative host site includes an 
alternative apphcation tier 415 and a n alternative web tier 417 . 
5 Apphcation T he alternative apphcation tiei>41:5. which is similar to 
apphcation tier 15 of host site API , includes an altemative 
apphcation processing function-431r, a n altemative apphcation-tier 
resource monitoi^-435, and altemative r esource parameters— 435. 
However, the altemative w eb tier 417 of the altemative h ost site 
1 0 AP2 is quite distinct from the parallel arrangement of web tier 17 of 
host site API. 

[0050] Web -Altemative web tier 417 comprises six servers-4?^ 
425, 42G, 427, 428, and 429 . Serv-crs 423, 425, 427 and 429 Fourof 
these servers are dedicated to specific functions that were shared by 

1 5 aU servers in web tier 17 of host site API. Server 426 One of the 

other two altemative servers p rovides additional processing for the 
encryption/decryption function. Qu e ue The other altemative server 
is a queue m anager 4g ft that adds a function of queuing chent and 
apphcation requests for serial routing by the altemative admissions 

2 0 control gateway-4£3. 

[0051] Admission The altemative admissions control gateway 
423 of riG, 4 performs the function of admission control gateway 
23 and the load-balancing function of load balancer 21 of FIG. 2. It 
essentially acts as the load-balancing hub for the altemative w eb 
2 5 tier 417 . It determines whether encryption or decryption is required 
and directs messages to one of the altemative 
encryption/decryption servers 425, 426 , as required. It implements 
deferral pohcy as directed by the altemative deferral manager-429. 
It directs appropriate chent requests to chent rcqucst chent-reauest 

15 
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processoi--^27. Client and application requests are referred to the 
queue managei"-*29 for ordered processing. The one function that 
is not routed through the alternative g ateway 423 is the resource 
monitoring function, in w^hich resource utilization data is 
5 transmitted directly to the alternative deferral manager server 429. 

[0052] An advantage to the v^eb-tier hub approach of the 
alternative host site AP2 i s that scaling can be done fimction by 
function, as it has been with encryption and decryption. Functions 
that are not heavily utilized need not be expanded to new servers. 
1 0 On the other hand, functions can be combined on a server to avoid 
imderutihzation of hardware. 

[0053] The present invention has appUcabiUty to network 
communications and e-commerce. The invention is scaleable to any 
number of servers and provides for a wide variety of tier 
1 5 architectures with varying numbers of subtiers. These and other 
modifications to and variations upon the present invention are 
provided for by the present invention, the scope of which is defined 
by the following claims. 

[0054] What Is Claimed Is : 
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CLAIMS 



1 1- A host site comprising: 

2 an application tier that generates resource utilization data; and 

3 a cUent-response tier that performs admission control using said 

4 resource utilization data. 

1 2- A host site as recited in Claim 1 wherein: 

2 said chent-response tier has 

3 an admissions control gateway for receiving client requests 

4 from chent computers; 

5 a deferral manager for making determinations concerning 

6 which of said chent requests should be admitted for 

7 processing at least in part as a function of apphcation- 

8 resource-utihzation data; 

9 a cUent-request processor for processing chent requests 

10 admitted for processing, said chent-request processor 

1 1 generating apphcation requests in response to at least some 

1 2 of said chent requests, said chent-request processor 

1 3 generating chent responses in response to apphcation 

14 responses received in response to said apphcation requests; 

15 and 

1 6 said apphcation tier has 

1 7 at least one apphcation-tier server not in said chent- 
1 8 response tier, 

1 9 an apphcation processing function running on said server 

20 for responding to said apphcation requests with said 

2 1 apphcation responses, and 

22 a resource monitor for generating said apphcation- 

23 resource-utihzation data by monitoring utihzation parameters 

24 on said server, said resource monitor commimicating said 

2 5 resource-utihzation to said deferral manager. 



17 



us SN 09/91 6.1 01 Substitute Specification— Marked Version 
1 0006843-1 (Harkin et al.) H0009P41 R-marked.doc PATENT 



1 3. A system as recited in Claim 2 wherein said client-request 

2 processor transmits said client responses to said client computers 

3 in accordance with an http protocoL 

1 4. A system as recited in Claim 2 wherein said cUent-response 

2 tier includes at least one chent-response-tier server on which said 

3 deferral manager and said cUent-request processor run, said chent- 

4 response-tier also having a chent-response-tier resource monitor for 

5 generating chent-response-tier resource utihzation data as a 

6 function of parameters associated with said chent-response-tier 

7 server, said chent-response-tier resource monitor providing said 

8 chent-response-tier resource utihzation data to said deferral 

9 manager, said deferral manager making said determinations in part 
10 as a function of said chent-response-tier resource utihzation data. 

1 5- A system as recited in Claim 4 wherein said deferral manager 

2 also generates requests to expand the processing capacity on said 

3 chent-response tier, 

1 6. A system as recited in Claim 2 wherein said deferral manager 

2 generates requests to expand the processing capacity on said 

3 apphcation tier. 

1 7- A system as recited in Claim 6 wherein said processing 

2 capacity is expanded by one or more of the foUowing: increasing the 

3 nimiber of servers, increasing the number of processors, increasing 

4 CPU entitlement. 
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1 8- A network hosting method for a host site having an 

2 appUcation tier and a cUent-response tier, said appUcation tier 

3 having at least one server not included in said cUent-response tier, 

4 said method comprising a client-response process and a resource- 

5 monitoring process, 

6 said resom'ce-monitoring process involving 

7 al) monitoring resom-ce utiUzation at said appUcation tier, 

8 a2) generating resom'ce-utihzation data at said application 

9 tier and transmitting said resom^ce-utilization data to said 

1 0 chent-response tier, and 

1 1 said request processing involving 

1 2 bl) receiving at said cUent-response-tier a cUent request 

1 3 from a cUent via a network, 

14 b2) making an admissions determination at said cUent- 

1 5 response tier whether said cUent request should be admitted 

16 immediately or deferred, said determination being made at 

1 7 least in part as a function of said resource-utiUzation data, 

1 8 b3) when said client request is admitted, sending an 

1 9 application request from said cUent-response tier to said 

20 appUcation tier, 

21 b4) generating an appUcation response at said appUcation 

22 tier and transmitting said appUcation response to said cUent- 

23 response tier, and 

24 b5) generating a cUent response as a fimction of said 

2 5 appUcation response and transmitting said cUent response to 
26 said cUent via said network. 
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1 9. A method as recited in Claim 8 wherein at step b2) in the 

2 event a determination is made to defer admission of said cUent 

3 request, a notification is sent to said chent to that effect via said 

4 network. 

1 10. A method as recited in Claim 8 wherein at step b2, in the 

2 event a determination is made to defer admission, a request is 

3 generated to increase the processing capacity on said apphcation 

4 tier. 

1 11. A method as recited in Claim 8 wherein at step b2) said 

2 determination is also a function of resource-utiUzation data 

3 generated at said chent-response tier. 

1 12. A host site method comprising: 

2 generating resource-utilization data on an apphcation tier; and 

3 using said data on a chent-response tier to determine admission- 

4 control pohcy. 
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ABSTRACT OF THE DISCLOSURE 

An e-commerce web site has an application tier and a web 
tier, each with multiple servers. The application tier runs the 
resource-intensive functions germane to the site's piupose; while 
5 the web tier provides the primary interface for cUents. The web tier 
implements admission control so that some requests can be 
deferred during periods of heavy traffic. In addition, the web tier 
can request additional web-tier and apphcation-tier resources when 
they can be added cost effectively. The apphcation tier monitors its 

1 0 own resource utilization and communicates resource-utOization 
data to the web tier. The web tier uses this appUcation-tier 
resource-utOization data to determine its deferral criteria. In 
addition, the web-tier can monitor its ovm resource utiUzation and 
take the results into account in determining the deferral criteria. By 

1 5 monitoring apphcation-tier utihzation at the apphcation tier rather 
than remotely at the web tier, more accurate and more flexible 
information is obtained, allowing for a better-optimized deferral 
scheme. 
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